End-to-End Video Question-Answer Generation With Generator-Pretester Network

نویسندگان

چکیده

We study a novel task, Video Question-Answer Generation (VQAG), for challenging Question Answering (Video QA) task in multimedia. Due to expensive data annotation costs, many widely used, large-scale QA datasets such as Video-QA, MSVD-QA and MSRVTT-QA are automatically annotated using Caption (CapQG) which inputs captions instead of the video itself. As neither fully represent video, nor they always practically available, it is crucial generate question-answer pairs based on via (VQAG). Existing video-to-text (V2T) approaches, despite taking input, only question alone. In this work, we propose model Generator-Pretester Network that focuses two components: (1) The Joint Generator (JQAG) generates with its corresponding answer allow “Answering” training. (2) Pretester (PT) verifies generated by trying checks pretested both model’s proposed ground truth answer. evaluate our system available human-annotated achieves state-of-the-art generation performances. Furthermore, can surpass some supervised baselines. pre-training strategy, outperform CapQG transfer learning approaches when employing semi-supervised (20%) or data. These experimental results suggest perspectives

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs

This paper focuses mainly on the problem of Chinese medical question answer matching, which is arguably more challenging than open-domain question answer matching in English due to the combination of its domain-restricted nature and the language-specific features of Chinese. We present an end-to-end character-level multi-scale convolutional neural framework in which character embeddings instead...

متن کامل

Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering

Most work on natural language question answering today focuses on answer selection: given a candidate list of sentences, determine which contains the answer. Although important, answer selection is only one stage in a standard end-to-end question answering pipeline. is paper explores the eectiveness of convolutional neural networks (CNNs) for answer selection in an end-to-end context using th...

متن کامل

Comparison of nerve repair with end to end, end to side with window and end to side without window methods in lower extremity of rat

Abstract Background : Although, different studies on end-to-side nerve repair, results are controversial. The importance of this method in case is unavailability of proximal nerve. In this method, donor nerves also remain intact and without injury. In compare to other classic procedures, end-to-side repair is not much time consuming and needs less dissection. Overall, the previous studies i...

متن کامل

Amathematical theory of compressed video buffering: Traffic regulation for end-to-end video network QoS

The recent successes of over-the-top (OTT) video services have intensified the competition between the traditional broadcasting video and OTT video. Such competition has pushed the traditional video service providers to accelerate the transition of their video services from the broadcasting video to the carrier-grade IP video streaming. However, there are significant challenges in providing lar...

متن کامل

End-to-end esophagojejunostomy versus standard end-to-side esophagojejunostomy: which one is preferable?

Abstract Background: End-to-side esophagojejunostomy has almost always been associated with some degree of dysphagia. To overcome this complication we decided to perform an end-to-end anastomosis and compare it with end-to-side Roux-en-Y esophagojejunostomy. Methods: In this prospective study, between 1998 and 2005, 71 patients with a diagnosis of gastric adenocarcinoma underwent total gastrec...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2021

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2021.3051277